首页> 外文OA文献 >Symbolic Complexity for Nucleotide Sequences: A Sign of the Genome Structure
【2h】

Symbolic Complexity for Nucleotide Sequences: A Sign of the Genome Structure

机译:核苷酸序列的符号复杂性:基因组的标志   结构体

摘要

We introduce a method to estimate the complexity function of symbolicdynamical systems from a finite sequence of symbols. We test such complexityestimator on several symbolic dynamical systems whose complexity functions areknown exactly. We use this technique to estimate the complexity function forgenomes of several organisms under the assumption that a genome is a sequenceproduced by a (unknown) dynamical system. We show that the genome of severalorganisms share the property that their complexity functions behavesexponentially for words of small length $\ell$ ($0\leq \ell \leq 10$) andlinearly for word lengths in the range $11 \leq \ell \leq 50$. It is also foundthat the species which are phylogenetically close each other have similarcomplexity functions calculated from a sample of their corresponding codingregions.
机译:我们介绍一种从符号的有限序列估计符号动力学系统的复杂度函数的方法。我们在复杂度函数确切已知的几个符号动力学系统上测试这种复杂度估计器。在基因组是由(未知)动力学系统产生的序列的假设下,我们使用这种技术来估计几种生物的基因组的复杂度函数。我们显示了几种生物的基因组共享以下属性:对于长度为$ \ ell $($ 0 \ leq \ ell \ leq 10 $)的单词,其复杂度函数呈指数规律,对于范围为$ 11 \ leq \ ell \ leq 50的单词长度呈线性关系$。还发现,在系统发育上彼此靠近的物种具有从其相应编码区的样本计算出的相似复杂度函数。

著录项

  • 作者

    Salgado-Garcia, R.; Ugalde, E.;

  • 作者单位
  • 年度 2013
  • 总页数
  • 原文格式 PDF
  • 正文语种 {"code":"en","name":"English","id":9}
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号